Models for automatic classi cation of video
نویسنده
چکیده
In this paper, we explore a technique for automatic classiication of video sequences, (such as a TV broadcast, movies). This technique analyses the incoming video sequences and classiies them into categories. It can be viewed as an on-line parser for video signals. We present two techniques for automatic classiication. In the rst technique, the incoming video sequence is analyzed to extract the motion information. This information is optimally projected onto a single dimension. This projection information is then used to train Hidden Markov Models (HMMs) that eeciently and accurately classify the incoming video sequence. Preliminary results with 50 diierent test sequences (25 Sports and 25 News sequences) indicate a classiication accuracy of 90% by the HMM models. In the second technique, 24 full length motion picture trailers are classiied using HMMs. This classiication is compared with the internet movie database and we nd that they correlate well. Only two out of 24 trailers were classiied incorrectly.
منابع مشابه
News Video Classi cation based on Semantic Attributes of Captions
As a basis for automatic indexing to video data based on shot classi cation, we will present a graphical classi cation rule acquisition method based on semantics of accompanying natural language data i.e. captions. A preliminary experiment to actual television news programs showed good correspondence between graphical characteristics and semantic attributes of captions.
متن کاملHarmonium Models for Video Classi cation
Accurate and e cient video classi cation demands the fusion of multi-modal information and the use of intermediate representations. Combining the two ideas into the one framework, we propose a series of probabilistic models for video representation and classi cation using intermediate semantic representations derived from multi-modal features of video. Based on a class of bipartite undirected g...
متن کاملRobust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male/female classification
In this paper we present a new automatic male/female classi cation method based on the location in the frequency domain of the rst 2 formants. This classi cation is based on a new automatic formant extraction which is faster than a peak picking technique. Gender-dependent acoustic-phonetic models stemming from this classi cation are used in the INRS Continuous speech recognition system with ATI...
متن کاملA Cascade Visual Front End for Speaker Independent Automatic Speechreading
We propose a three-stage pixel based visual front end for automatic speechreading (lipreading) that results in signi cantly improved recognition performance of spoken words or phonemes. The proposed algorithm is a cascade of three transforms applied on a three-dimensional video region-of-interest that contains the speaker's mouth area. The rst stage is a typical image compression transform that...
متن کاملAudio-Visual Speaker Veri cation using Continuous Fused HMMs
This paper examines audio-visual speaker veri cation using a novel adaptation of fused hidden Markov models, in comparison to output fusion of individual classi ers in the audio and video modalities. A comparison of both hidden Markov model (HMM) and Gaussian mixture model (GMM) classi ers in both modalities under output fusion shows that the choice of audio classi er is more important than vid...
متن کاملAudio-video biometric recognition for non-collaborative access granting
In this paper, the problem of non collaborative person identi cation for a secure access to facilities is addressed. The proposed solution adopts a face and a speaker recognition techniques. The integration of these two methods allows to improve the performance with respect to the two classi ers. In non collaborative scenarios, the problem of face recognition rst requires to detect the face pat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997